Failure Data Analysis of a LAN of Windows NT based Computers

نویسندگان

  • M. Kalyanakrishnam
  • Zbigniew T. Kalbarczyk
  • Ravishankar K. Iyer
چکیده

This paper presents results of a failure data analysis of a LAN of Windows NT machines. Data for the study was obtained from event logs collected over a sixmonth period from the mail routing network of a commercial organization. The study focuses on characterizing causes of machine reboots. The key observations from this study are: (1) most of the problems that lead to reboots are software related, (2) rebooting the machine does not always solve the problem (in about 60% of the reboots, the rebooted machine reported problems within an hour or two of the reboot), (3) there are indications of propagated or correlated failures, and (4) though the average availability evaluates to over 99%, the machine downtime lasts (on average) two hours. Since the machines are dedicated mail servers, bringing down one or more of them can potentially disrupt storage, forwarding, reception and delivery of mail. This suggests that the average availability is not a good measure to characterize this type of network service.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data

The discussion in this paper focuses on the issues involved in analyzing the availability of networked systems using fault injection and the failure data collected by the logging mechanisms built into the system. In particular we address: (1) analysis in the prototype phase using physical fault injection to an actual system. We use example of fault injection-based evaluation of a software-imple...

متن کامل

Event Log based Dependability Analysis of Windows NT and 2K Systems

This paper presents a measurement-based dependability study using event logs collected during about 3 years from 133 Windows NT and 2K workstations and servers interconnected through a LAN. We focus on the identification of machine reboots, the classification of their causes, and the evaluation of statistics characterizing the uptimes, downtimes, and the availability of the Windows NT and 2K ma...

متن کامل

A Statistical Analysis of User-Specific Profiles

This technical report examines common assumptions about computer users in profile-based optimization. We study execution profiles of interactive applications on Windows NT to understand how different users use the same program. The profiles were generated by the DIGITAL FX!32 emulator/binary translator system, which automatically runs the x86 version of Windows NT programs on NT/Alpha computers...

متن کامل

Serving Scientists Worldwide

s Editors: Kari Mielikäinen, Harri Mäkinen and Mauri Timonen A bs tr ct s — W or ld D en dr o 20 10 The 8th International Conference on Dendrochronology June 13 – 18, 2010, Rovaniemi, Finland Serving Scientists Worldwide since 1991 More details at www.regentinstruments.com [email protected] • Fax: 418-653-1357 • REGENT INSTRUMENTS INC. Image Analysis Systems for Plant Science Based on...

متن کامل

Networked Windows NT System Field Failure Data Analysis

This paper presents a measurement-based dependability study of a Networked Windows NT system based on field data collected from NT System Logs from 503 servers running in a production environment over a four-month period. The event logs at hand contains only system reboot information. We study individual server failures and domain behavior in order to characterize failure behavior and explore e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999